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Abstract 

Cartoon- like images, i.e., C 2 functions which are smooth apart from a C 2 dis- 
continuity curve, have by now become a standard model for measuring sparse 
(non-linear) approximation properties of directional representation systems. 
It was already shown that curvelets, contourlets, as well as shearlets do ex- 
hibit (almost) optimally sparse approximations within this model. However, 
all those results are only applicable to band-limited generators, whereas, in 
particular, spatially compactly supported generators are of uttermost impor- 
tance for applications. 

In this paper, we now present the first complete proof of (almost) op- 
timally sparse approximations of cartoon-like images by using a particular 
class of directional representation systems, which indeed consists of com- 
pactly supported elements. This class will be chosen as a subset of shearlet 
frames - not necessarily required to be tight - with shearlet generators having 
compact support and satisfying some weak moment conditions. 

Keywords: Curvilinear discontinuities, edges, nonlinear approximation, 
optimal sparsity, shearlets, thresholding, wavelets 

1. Introduction 

In computer vision, edges were detected as those features governing an 
image while separating smooth regions in between. About 10 years ago, 
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mathematicians started to design models of images incorporating those find- 
ings aiming at designing representation systems which - in such a model - 
are capable of resolving edges in an optimally sparse way. However, custom- 
arily, at that time an image was viewed as an element of a compact subset of 
L p characterized by a given Besov regularity with the Kolmogorov entropy 
of such sets identifying lower bounds for the distortion rates of encoding- 
decoding pairs in this model. Although wavelets could be shown to behave 
optimally jij as an encoding methodology, Besov models are clearly deficient 
since edges are not adequately captured. This initiated the introduction of a 
different model, called cartoon-like model (see 19, El), which revealed the 
suboptimal treatment of edges by wavelets. 

The introduction of tight curvelet frames in 2004 by Candes and Donoho 
which provably provide (almost) optimally sparse approximations within 
such a cartoon-like model can be considered a milestone in applied harmonic 
analysis. One year later, contourlets were introduced by Do and Vetterli |H 
which similarly derived (almost) optimal approximation rates. In the same 
year, shearlets were developed by Labate, Weiss, and the authors in [l7J as 
the first directional representation system with allows a unified treatment 
of the continuum and digital world similar to wavelets, while also providing 
(almost) optimally sparse approximations within such a cartoon-like model 

In most applications, spatial localization of the analyzing elements of an 
encoding system is of uttermost importance both for a precise detection of 
geometric features as well as for a fast decomposition algorithm. However, 
none of the previously mentioned results cover this situation. In fact, the 
proofs which were provided do by no means extend to this crucial setting. 

In this paper, we now present the first complete proof of (almost) op- 
timally sparse approximations of cartoon-like images by using a particular 
class of directional representation systems, which indeed consist of compactly 
supported elements. This class will be chosen as a subset of shearlet frames 
- not necessarily required to be tight - with shearlet generators having com- 
pact support and satisfying some weak moment conditions. Interestingly, 
our proof is very different from all previous ones caused by the extensive 
exploration of the compact support of the shearlet generators and the lack 
of directional vanishing moments. 
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1.1. A Suitable Model for Images: Cartoon- Like Images 

Intuitively, cartoons are smooth image parts separated from other areas 
by an edge. After a series of initial models ji], [l{|, the first complete model 
of cartoons has been introduced in [l|], and this is what we intend to use also 
here. The basic idea is to choose a closed boundary curve and then fill the 
interior and exterior part with C 2 functions (see Figured]). 




Figure 1: Example of a cartoon- like image. 

Let us now be more precise, and introduce STAR 2 {y), a class of indicator 
functions of sets B with C 2 boundaries dB and curvature bounded by u, as 
well as S 2 (u), a class of cartoon-like images. For this, in polar coordinates, 
we let p{9) — y [0, 1] be a radius function and define the set B by 

B = {x G M 2 : \x\ < p{6), x = (\x\,6) in polar coordinates}. 

In particular, the boundary dB of B is given by the curve 

and the class of boundaries of interest to us are defined by 

sup |p" (0)| < v, p< Po <l. (2) 

The following definition now introduces the notions STAR 2 {u) and S 2 (u) 
from [if]. 

Definition 1.1. For v > 0, the set STAR 2 (i>) is defined to be the set of all 
Be [0, l] 2 such that B is a translate of a set obeying ([I]) and (J2J). Further, 
£ 2 {v) denotes the set of functions f on M 2 with compact support in [0, l] 2 of 
the form 

f = fo + flXB, 

where fo, fi G C 2 (IR 2 ) with compact support in [0, l] 2 , B G STAR 2 (v), and 

C 2 =E H < 2 P a /Hoo<l. 
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1.2. Optimal Sparsity of a Directional Representation System 

The 'quality' of the performance of a (directional) representation system 
with respect to cartoon-like images is typically measured by taking a non- 
linear approximation viewpoint. More precisely, given a cartoon-like image 
/ G S 2 (u) and a (directional) representation system (erj)i 6 j which forms an 
orthonormal basis, the chosen measure is the asymptotic behavior of the best 
iV-term (non-linear) approximation error in L 2 norm in the number of terms 
N, i.e., 

Wf-fN\\ 2 2=\\f-Y,(f' a >* 

iei N 

where ((/, <7j)) ieJjv are the N largest coefficients (/, o"j) in magnitude. Wavelet 
bases exhibit the approximation rate 

\\f - f N \\l<C ■ N' 1 asiV^oo. 



as iV — > oo, 



However, Donoho proved in [6[ that the optimal rate which can be achieved 
under some restrictions on the representation system as well as on the selec- 
tion procedure of the approximating coefficients is 

\\f-f N \\l<C-N- 2 asiV^oo. 

It was a breakthrough in 2004, when Candes and Donoho introduced the 
tight curvelet frame in [l[ and proved that this system indeed does satisfy 

11/ - /ivlll < C ■ AT" 2 - (logiV) 3 asiV^oo, 

where again the approximation was generated by the N largest coeffi- 
cients in magnitude. Although the optimal rate is not completely achieved, 
the log-factor is typically considered negligible compared to the A r_2 -factor, 
wherefore the term 'almost optimal' has been adopted into the language. 
This result is even more surprising taking into account that in case of a tight 
frame the approximation by the N largest coefficients in magnitude does not 
even always yield the best N-term approximation. 

1.3. (Compactly Supported) Shearlet Systems 

The directional representation system of shearlets has recently emerged 
and rapidly gained attention due to the fact that - in contrast to other pro- 
posed directional representation systems - shearlets provide a unified treat- 
ment of the continuum and digital world similar to wavelets. We refer to, 



4 



10. 71 : 



15 1 for the continuum theory, (l6l . 0, [l8[ for the digital theory, and 
10l . l7j for recent applications. 

Shearlets are scaled according to a parabolic scaling law encoded in the 
parabolic scaling matrices A 2 j or A 2 j, j G Z, and exhibit directionality by 
parameterizing slope encoded in the shear matrices Sk, k G Z, defined by 
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respectively. 

To ensure an (almost) equal treatment of the different slopes, which is 
evidently of significant importance for practical applications, we partition 
the frequency plane into the following four cones C\ - C±: 



{(6,6) G 

{(6,6) g 
{(6,6) g I 
{(6,6) el 



p2 . 



:6>i, 16/61 <i} 

M 2 : 6 > 1, 16/61 < 1} 
2 :6<-i, 16/61 <i} 
2 : 6 < -i, 16/61 < 1} 



L=l, 
L = 2, 
L = 3, 

t = 4, 



and a centered rectangle 



^ = {(6,6 



:||(6,6)IU<1}. 



For an illustration, we refer to Figure |2(a). 

The rectangle 7?. corresponds to the low frequency content of a signal 
and is customarily represented by translations of some scaling function. 
Anisotropy comes into play when encoding the high frequency content of 
a signal which corresponds to the cones C\ - C&, where the cones C\ and C3 
as well as C 2 and C4 are treated separately as can be seen in the following 

Definition 1.2. For some sampling constant c > 0, the (cone-adapted) 
shearlet system S1-L(c; (j), xf), ij)) generated by a scaling function <p G L 2 (IR 2 ) 
and shearlets ip, ip G L 2 (M 2 ) is defined by 

SU(c; <P, i/;, t/0 = $(c; 0) U #(c; V) U tt(c; 
$(c; 0) = {0 m = 0(- -cm):™e Z 2 }, 
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(a) (b) 

Figure 2: (a) The cones C\ - C4 and the centered rectangle TZ in frequency domain, (b) 
The tiling of the frequency domain induced by a (cone-adapted) shearlet system. 

*(c;V») = {^ jAm = 2 3 ^^(S k A 23 ■ -cm) : j > 0, < \2^ 2 ],m G Z 2 }, 
and 

§(c;$ = {$ jjk>rn = 2^l^(SlA 2J ■ -cm) : j > 0, < \^%m G Z 2 }. 

The reader should keep in mind that although not indicated by the nota- 
tion, the functions m , i>j^,mi an d ^j,k,m all depend on the sampling constant 
c. For the sake of brevity, we will often write ip\ and where A = (j, k, m) 
index scale, shear, and position. For later use, we further let Aj be the 
indexing sets of shearlets in \l/(c; ip) and \I/(c; ip) at scale j, respectively, i.e., 

*(c;^) = {^:A6A i ,j = 1 ...,oo} 

and 

*(c;^) = {i> x : A G k h j = 0, . . . , 00}. 

Finally, we define 

00 

A=|J A r 

j=0 

The tiling of frequency domain induced by iS"H(c; 0, ip, ip) is illustrated 
in Figure EJ^b). From this illustration, the anisotropic footprints of shearlets 
contained in ty(c;ip) and ty(c;ip) can clearly be seen. However, the reader 
should notice that the tiling indicated here is based on the essential support 
and not the exact support of the analyzing elements, since our focus will be on 
shearlet systems associated with spatially compactly supported generators. 
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The corresponding anisotropic footprints of shearlets in spatial domain are 
of size 2~i/ 2 times 2~K A beautiful intuitive extensive explanation of why 
it is conceivable that such a system - based on parabolic scaling - exhibits 
optimal sparse approximation of cartoon-like images, is provided in and 
we would like to refer the reader to this paper. The main idea is to count 
the number of shearlets intersecting the discontinuity curves, which is 'small' 
compared to the number of such wavelets, due to their anisotropic footprints. 

Certainly, we naturally ask the question when S7i(c; <p, ip, i/j) does form a 
frame for L 2 (IR 2 ). The wavelet literature provides various necessary and suf- 
ficient conditions for $(c; 0) to form a frame for L 2 ({f 6 L 2 (M 2 ) : supp(/) C 
7Z}), also when is compactly supported in spatial domain. Although not 
that well-studied as wavelets yet, several answers are also known for the 
question when \l/(c; ip) forms a frame for 

L 2 ({/GL 2 (M 2 ):supp(/)CCiUC 3 }), 

and we refer to results in 0, Q 3 O- Since (c; <j>) and tf(c; are linked 
by a simple rotation of 90°, these results immediately provide conditions for 
\&(c; tp) to constitute a frame for 

L 2 ({/GL 2 (M 2 ):supp(/)CC2UC 4 }). 



Very recent results in [12j even focus specifically on the case of spatially 
compactly supported shearlets - of uttermost importance for applications 



due to their superior localization. For instance, in [12[, the following special 
class of compactly supported shearlet frames for L 2 (M. 2 ) was constructed: 
The generating shearlets ip and ip were chosen separable, i.e., of the form 
ipi(xi) ■ ^2(^2) an d 4>i{x2) ■ ^2(^1), respectively, where ipi is a wavelet and 
ip2 is a scaling function both associated with some carefully chosen low pass 
filter. Intriguingly, our main result in this paper (Theorem II. 3p proves as 
a corollary that this class of compactly supported shearlet frames provides 
(almost) optimally sparse approximations of cartoon-like images. We refer 
to [l2| for the precise statement. 

Combining those thoughts, we can attest that frame properties of the sys- 
tem ST-L(c; cj),ip,ip) including spatially compactly supported generators are 
already quite well studied. We however wish to mention that there is a 
trade-off between compact support of the shearlet generators, tightness of the 
associated frame, and separability of the shearlet generators. The known con- 
structions of tight shearlet frames do not use separable generators, and these 
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constructions can be shown to not be applicable to compactly supported 
generators. Tightness is difficult to obtain while allowing for compactly 
supported generators, but we can gain separability, hence fast algorithmic 
realizations. On the other hand, when allowing non-compactly supported 
generators, tightness is possible, but separability seems to be out of reach, 
which makes fast algorithmic realizations very difficult. 

I. 4- Optimally Sparse Approximation of Cartoon-Like Images by Shearlets 

The concept of optimally sparse approximation of cartoon-like images of 
general (directional) representation systems was already discussed in Section 

II. 21 However, the attentive reader will have realized that only the situa- 
tion of tight frames was studied whereas here we need to consider sparse 
approximations by arbitrary frames. Hence this situation deserves a careful 
commenting. 

Let ST-L(c;(j),ip,ip) be a shearlet frame for L 2 (IR 2 ), which for illustrative 
purposes for a moment we denote by ST-L(c; 0, ip, ijj) = (crj) ie /, say. Is it well- 
known that a frame is associated with a canonical dual frame, which in this 
case we want to call (<5i)j 6 /. Then we define the iV-term approximation /jv 
of a cartoon-like image / G S 2 (u) by the frame <S"H(c; 0, -0, ip) to be 



where ((f,ai))i e i N are the N largest coefficients (f,o~i) in magnitude. As in 
the tight frame case, this procedure does not always yield the best iV-term 
approximation, but surprisingly even with this 'crude' selection procedure - 
in the situation of spatially compactly supported generators - we can prove 
an (almost) optimally sparse approximation rate as our main result shows. 

Theorem 1.3. Let c > 0, and let (f),ifj,ifj G L 2 (M 2 ) be compactly supported. 
Suppose that, in addition, for all £ = (^1,^2) £ ^ 2 > the shearlet if; satisfies 




(i) |^(0I < Ci • mini 1 , 161°) ■ min(l, ^il" 7 ) • min(l, l^l" 7 ) and 



(ii) 



d 

96 



m\<m l )\-(\ + 



) 
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where a > 5, 7 > 4, h G L 1 (R) ; and C\ is a constant, and suppose that the 
shearlet ip satisfies (i) and (ii) with the roles of ^1 and £2 reversed. Further, 
suppose that S7-L(c; <f>, ip, ip) forms a frame for L 2 (M. 2 ). 

Then, for any v > 0, the shearlet frame S7i(c; (f>, 1/}, ijj) provides (almost) 
optimally sparse approximations of functions f G £ 2 (v), ie. ; there exists 
some C > such that 

11/ - f N \\l <C-N~ 2 - (log iV) 3 as N -+ 00, 

where f^ is the nonlinear N-term approximation obtained by choosing the N 
largest shearlet coefficients of f . 

Condition (i) can be interpreted as both a condition ensuring (almost) 
separable behavior as well as a first order moment condition along the hor- 
izontal axis, hence enforcing directional selectivity. This condition ensures 
that the support of shearlets in frequency domain is essentially of the form 
indicated in Figure [2(b). Condition (ii) (together with (i)) is a weak version 
of a directional vanishing moment conditionzl, which is crucial for having fast 
decay of the shearlet coefficients when the corresponding shearlet intersects 
the discontinuity curve. Conditions (i) and (ii) are rather mild conditions 
on the generators. To compare with the optimality result for band-limited 
generators we wish to point out that conditions (i) and (ii) are obviously 
satisfied for band-limited generators. 

Notice also that, intriguingly, the - the 'true' optimality destroying - log- 
factor has the same exponent as in the curvelet-, contourlet-, and shearlet- 
result on (almost) optimally sparse approximation. 

1.5. Prior Work and Our Contribution 

In 2004, Candes and Donoho [l| achieved a breakthrough when introduc- 
ing tight curvelet frames, which provide (almost) optimally sparse approx- 
imations of cartoon-like images (functions in £ 2 (z/)). The main outline of 
their proof is to break [0, l] 2 into smaller cubes and then separately analyze 
the curvelet coefficients essentially centered in the smooth part of the model 
and those essentially centered on the discontinuity curve. For both sets of co- 
efficients their weak-£ 2 /3 norm is estimated; the estimate for the 'non-smooth 
part' also requiring the usage of the Radon transform. 



2 For the precise definition of directional vanishing moments, we refer to 
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A year later, Do and Vetterli [4j introduced contourlets and proved similar 
sparsity results for those. However, although their work includes contourlets 
with compact support, their construction is fully based on discrete filter 
banks so that directional selectivity is problematic. Because of this fact, 
infinite directional vanishing moments had to be artificially imposed in order 
to achieve (almost) optimal sparsity. However, this is impossible for any 
function with compact support to satisfy. Hence, similar to curvelets, optimal 
sparsity is only proven for band-limited contourlets. 

In 2005, shearlets were introduced as the first directional representation 
system ensuring a unified treatment of the continuum and digital world by 



Labate, Weiss, and the authors in [17|. One year later, Labate and Guo 



proved (almost) optimally sparse approximations of cartoon-like images for 



the at that time customarily utilized shearlet frames which are band- 
limited such as curvelets. The proof the authors provided follows the proof 
in [l[ very closely step by step. 

Concluding, although those pioneering studies deserve all our credit, these 
results are far from including the important class of directional representation 
systems consisting of compactly supported functions. 

The main contribution of this paper is to provide the first complete proof 
of (almost) optimally sparse approximations of cartoon-like images using a 
directional representation system consisting of compactly supported func- 
tions. Our proof is indeed very different from all previous ones caused by 
the necessary extensive exploration of the compact support of the shearlet 
generators, the only similarity being the breaking of [0, l] 2 into smaller cubes 
and the separate consideration of shearlet coefficients now being exactly con- 
tained - in contrast to being essentially contained for all other systems - in 
the smooth part and those which intersect the discontinuity curve. Previous 
results all require moment conditions along the direction of the discontinuity 
curve - thereby requiring vanishing moments along infinitely many directions 
asymptotically in scale -, which is trivially satisfied for band-limited gener- 
ators. Intriguingly, a weaker version of directional vanishing moments, even 
only in one direction and the shearing taking care of the remaining directions, 
is sufficient for our analysis. 

1.6. Outline 

In Section [21 we present the overall structure of the proof, the results of 
the analysis of shearlet coefficients being contained in the smooth part and 
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those which intersect the discontinuity curve, and - based on these results - 
state the proof of Theorem 11.31 The proofs of the results on the behavior of 
shearlet coefficients in the smooth and non-smooth part are then carried out 
in Sections [3] and HI respectively. 

2. Architecture of the Proof of Theorem 11.31 

We now detail the general structure of the proof of Theorem 11.31 starting 
by introducing useful notions and explaining the blocking into smaller boxes 
and splitting into the smooth and non-smooth part. Then the main results 
concerning the analysis of shearlet coefficients being entirely contained in the 
smooth part and those intersecting the discontinuity curve will be presented 
followed by the proof of Theorem 11.31 based on those. 

2.1. General Organization 

Let now ST-L(c] <f>,if),if)) satisfy the hypotheses of Theorem II .3} and let 
/ G £ 2 {v)- Further, we let A denote the lower frame bound of SH(c; <fi, if), if)). 

We first observe that, without loss of generality, we might assume the 
scaling index j to be sufficiently large, since / as well as all frame elements in 
the shearlet frame SH(c; (f>, if), if)) are compactly supported in spatial domain, 
hence a finite number does not contribute to the asymptotic estimate we aim 
for. In particular, this means that we do not need to consider frame elements 
from $(c; (ft). Also, we are allowed to restrict our analysis to shearlets if)j t k,m, 
since the frame elements if>j,k,m can be handled in a similar way. 

Our main concern will be to derive appropriate estimates for the shearlet 
coefficients {(f,ip\) : A e A} of /. Letting \9(f)\ n denote the nth largest 
shearlet coefficient (/, if)\) in absolute value and exploring the frame property 
of SH{c; (f),if),if)), we conclude that 

II/-/*II2<^£ 

n>N 

for any positive integer N. Thus, for the proof of Theorem 11.31 it suffices to 
show that 

J]|^<C-iV- 2 -(logiV) 3 asiV^oo. (3) 

n>N 

To derive the anticipated estimate in Q, for any shearlet if)\, we will 
study two separate cases: 
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• Case 1. The compact support of the shearlet ip\ does not intersect the 
boundary of the set B, i.e., int(supp(^A)) H dB = 0. 

• Case 2. The compact support of the shearlet ip\ does intersect the 
boundary of the set B, i.e., int(supp(^\)) H 95 7^ 0. 

Notice that this exact distinction is only possible due to the spatial compact 
support of all shearlets in the shearlet frame. 

In the sequel - since we are concerned with an asymptotic estimate - for 
simplicity we will often simply use C as a constant although it might differ 
for each estimate. Also all the results in the sequel are independent on the 
sampling constant c > 0, wherefore we now fix it once and for all. 

2.2. The Smooth and the Non-Smooth Part of a Cartoon-Like Image 

To illustrate which conditions on ip required by Theorem 11.31 are utilized 

for the decay estimates of the different cases, in this section we do not make 

any initial assumptions on ip. 

Let us start with the smooth part, which is the easier one to handle. 

Dealing with this part allows us to consider some g G C 2 (IR 2 ) with compact 

support in [0, l] 2 and estimate its shearlet coefficients. This is done in the 

following proposition. Notice that the hypothesis on ip of the following result 

is implied by condition (i) in Theorem 11.31 

Proposition 2.1. Let g G C 2 (1R 2 ) with compact support in [0,1] 2 , and let 
ip G L 2 (R 2 ) be compactly supported and satisfy 

\m\ < Crmin(l, |6P)-min(l, |^p).min(l, |6D for all £ = G R 2 

where 7 > 3, a > 7 + 2, and C\ is a constant. Then, there exists some C > 
such that 

J2\0(9)\l<C-N- 2 as 00. 

n>N 

Thus, in this case, optimal sparsity is achieved. The proof of this propo- 
sition is given in Section El 

Next, we turn our attention to the non-smooth part, in particular, to 
estimating those shearlet coefficients whose spatial support intersects the 
discontinuity curve. For this, we first need to introduce some new notations. 
For any scale j > and any grid point p G Z 2 , we let Qj tP denote the dyadic 
cube defined by 

Q jtP = [-2-^/2, 2-^] 2 + 2-^V 
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Further, let Qj be the collection of those dyadic cubes Qj >p whose interior, 
in the following denoted by int(Q JiP ), intersects dB, i.e., 

Qj = {Qj, P : mt(Q^) n <95 ^ 0,p g Z 2 }. 

Of interest to us is also the set of shearlet indices, which are associated with 
shearlets intersecting the discontinuity curve inside some Qj tP G Qj, i.e., for 
j > and p G Z 2 with Qj jP G Qj, we will consider the index set 

Am, = {A G Aj : int(supp(^ A )) PI int(Q i)P ) D dB ^ 0}. 

Finally, for j > 0, p G Z 2 , and < e < 1, we define Aj iP (e) to be the index 
set of shearlets V'a, A G A JjP , such that the magnitude of the corresponding 
shearlet coefficient (f,ip\) is larger than e and the support of ip\ intersects 
Qj P at the jth scale, i.e., 

A j>p {e) = {A G A j)P : \(fM\>e}, 

and we define A(e) to be the index set for shearlets so that \{f,ijj\)\ > £ 
across all scales j, i.e., 

A( £ )=U A ^( £ )- 

j,p 

The expert reader will have noticed that in contrast to the proofs in [l[ and 



111 ], which also split the domain into smaller scale boxes, we do not apply 
a weight function to obtain a smooth partition of unity. In our case, this is 
not necessary due to the spatial compact support of the frame elements. 

As mentioned at the beginning of this section, we may assume that j is 
sufficiently large. Given some scale j > and position p G Z 2 for which the 
associated cube Qj tP satisfies Qj tP G Qj. Then the set 

S hP = U su PP(^a) 

is contained in a cubic window of size C-2^l 2 by C-2~H 2 , hence is of asymp- 
totically the same size as Qj, p - By smoothness assumption on the disconti- 
nuity curve dB, the edge curve can be parameterized by either (xi,E(xi)) 
or (x2, E{x2)) with E G C 2 in the interior of Sj tP for sufficiently large j\ 
Thus, we are facing the following two cases (see also Figure [3]): 



3 In other words, a part of the edge curve dB contained in <Sj )P can be described as a 
C 2 function x% = E{x2) ( or X2 = E(x±)). 
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• Case 2a. The edge curve dB can be parameterized by either (£"(0:2), x 2 ) 
or (xi, E(x\)) with E G C 2 in the interior of <Sj jP such that, for any 
A G Aj iP , there exists some x = (x\,x 2 ) G int(Qj iP )nint(supp(V>;0)n<9-B 
satisfying either |£"(x2)| < 2 or < 2. 

• Case 2b. The edge curve (95 can be parameterized by either (E(x 2 ), x 2 ) 
or (xi, E(xi)) with E G C 2 in the interior of iSj iP such that, for any 
A G Aj iP , there exists some x = (x\,x 2 ) G int(Q JiP ) flint (supp(^a)) C\dB 
satisfying either \E\x2)\ > 2 or |£"(a;i)| _1 > 2. Here, we identify 
E'{xi) = with lE'ixi)]' 1 = oo>2. 




(a) (b) 



Figure 3: (a) A part of the curve dB satisfying Case 2a. (b) A part of the curve dB 
satisfying Case 2b. 

For both cases, we will derive the below stated upper estimates 01]) and (jSJ) 
for the absolute value of the associated shearlet coefficients. The proofs of 
these estimates are contained in Section HJ 

Proposition 2.2. Letip G L 2 (M?) be compactly supported, and assume thatip 
satisfies conditions (i) and (ii) of Theorem \1.3l Further, let j > andp G Z 2 ; 
and let A G A JiP . For fixed x = (xi, x 2 ) G int(Qj iP ) fl int(supp(tp\)) fl dB, let s 
be the slop^ of the tangent to the edge curve dB at (xi,x 2 ), more precisely, 

(i) if dB is parameterized by (E(x 2 ),x 2 ) with E G C 2 in the interior of 
Sj )P , then s = E'(x 2 ), 

(ii) if dB is parameterized by (x\, E(xi)) with E G C 2 and E'(x\) ^ in 
the interior of Sj tP , then s = (-E'(xi)) -1 , and 



4 Notice that here we regard the slope of the tangent to a curve (E(x2), X2), i.e., we 
consider s of a curve indexed by the X2-axis, for instance, by x\ = sxi + b. For analyzing 
shearlets "0j,fc,m, the roles of X\ and X2 would need to be reversed. 
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(iii) if dB is parameterized by (xi, E(x\)) with E £ C 2 and E'(xi) = in 
the interior of Sj, p , then s = oo. 

Then there exists some C > such that 

\(fM\<C- *~* if\s\<3. (4) 



and 



\{fM\<C-2~y, if\s\ >^ors = oo, (5) 



Notice that in Case 2a, condition (i) or (ii) can occur, whereas in Case 
2b, all three conditions can occur. 

2. 3. Proof of Theorem [Ql 

Let / £ S 2 (u). We first observe that, by Proposition 12. 1[ we can neglect 
those shearlet coefficients whose spatial support of the associated shearlet 
does not intersect the discontinuity curve. 

To estimate the remaining shearlet coefficients, we need to analyze their 
decay properties. For this, let j > be sufficiently large and let p £ Z 2 , 
be such that the associated cube satisfies Qj >p £ Qj. We note that all sets 
int(supp( , 0A)) with A £ Aj tP are contained in the interior of Sj >p . Therefore 
weights as in [l[ and 11] are not required here. 



Letting e > 0, our goal will now be to estimate first |Aj iP (e:)| and then 
|A(e)|. WLOG we might assume H^lli < 1, which implies 

|(/,^a)|<2^/ 4 . 

Hence, for estimating |Aj )P (e)|, it is sufficient to restrict our attention to 
scales 

i^^iog^- 1 ). (6) 

We will now deal with Case 2a and Case 2b separately. 

Case 2a. First, we let s be the slope of the tangent to the edge curve 
dB at (£i,X2) as defined in Proposition 12.21 i.e., if OB is parameterized by 
(Efaz), X2) in the interior of Sj tP , then s = £"(£2), and if dB is parameterized 
by (xi, E(xi)) in the interior of Sj tP , then s = ^"(xi)" 1 . 

By assumption of Case 2a, we have s £ [—2,2]. Now observe that, for 
each shear index k, 

|{A = (j, k,m):Xe A jjP }\ < C ■ (\k + y/ 2 s\ + 1). (7) 
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Interestingly, this estimate is independent of the choice of the point x G 
int(<2j >p ) fl int(supp(^)) H dB, which can be seen as follows: Let x G 
int(Qj iP ) fl int(supp(^A)) H <9-B, £' G int(Q JiP ) D int(supp(^A')) H and let 
s and s' be the associated slopes of the tangents to the edge curve E in x 
and respectively. Since E G C 2 , 

\s-s'\ <Cf2-i/ 2 , (8) 

and hence 

|fc + 2^V| < C ■ {\k + 2 J / 2 s\ + 1). 

This proves that the estimate ((?]) remains the same asymptotically, indepen- 
dent of the values of s and s', and hence of x and x'. 

We further require the following even stronger observation: Fix some 
x G int(Qj iP ) H int(supp(^A)) H <9£>, and let s be the associated slope of the 
tangent to the edge curve E in x. Now consider another x' G int(Q JiP ) fl 
int(supp(^A')) H 95, and again let s' be the associated slope of the tangent 
to the edge curve E in x' . Then, for sufficiently large scaling index j, by 
OH]), \s — s'\ is sufficiently small and we may assume s' G [—3,3]. Hence the 
estimate (j4j) from Proposition 12.21 holds not only for x (with s), but also for 
x' (with s'). In fact, it can even be checked that by substituting s' by s in (j4j) 
the asymptotic behavior of the estimate for |A JiP (e)| does not change. Let us 
briefly outline the reasoning. First, we observe that WLOG we can assume 
that \k + 2 J / 2 s| > 2 ■ C*i, where C\ is the constant appearing in (jSj), since 
|{fc G Z : + 2 J / 2 s| < 2 • Ci}| < C with C being independent on j and hence 
it can be deduced that the hypothesis \k + 2^ 2 s\ > 2 ■ C\ for each j > does 
not affect our asymptotic estimate of |A J)P (e)|. From (jSJ), it then follows that 

\k + 2 j/2 s\ < 2 - |A; + 2 J ' /2 s / |, 

which in turn implies 

\k + 2i/ 2 s'\ 3 ~ 8 \k + 2^ 2 s\ 3 ' 

Hence, by substituting s' by s in (jlj) the asymptotic behavior of the estimate 
for |Aj jP (e)| does not change. Thus, it suffices to consider just one fixed 
x G mt(Qj tP ) fl int(supp(V'A)) H (9-B with associated slope s in each Qj tP . We 
now turn to estimating |Aj ]P (e)| in this case. 
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For this, by estimate fllj from Proposition 12.21 \(fii J \)\ > e implies 

\k + 2 j ' 2 s\ < C • e" 1/3 • 2^'/ 4 . (9) 
From ([7]) and ([H]), we then conclude 

|A itP (e)|<C- ^ (|A; + 2^ 2 s | + l) < C • (^ 1/3 • 2"^ 4 + l) 2 , (10) 

where K^e) = {keZ:\k + 2^ 2 s\ < C ■ e~^ 3 ■ 2^/ 4 }. 

Case 2b. Exploiting similar arguments as in Case 2a, it also suffices to 
consider just one fixed x G int(<2j P ) R int(supp(?/>A)) H 95 with associated 
slope s in each inl^Q^). Again, our goal is now to estimate |Aj iP (e)|. 

By estimate © from Proposition \2.2\ \(f,ip\) \ > e implies 

C ■ 2~^ > e, 

hence 

and j<~\og 2 {e- l ) + C. (11) 
Since there exists some C with 

|A,g <c-v, 

it then follows that 

|A ilP (e)|<C7.^. (12) 

Notice that this last estimate is exceptionally crude, but will be sufficient for 
our purposes. 

We now combine the estimates for |Aj )P (e)| derived in Case 2a and Case 
2b. Since 

#(Qi) <C-Ti\ 

by (HDD (and ©) and by (P} (and (02}), we have 



|A(e)| < C-[ 23/2 ( £ " 1/3 - 2_1/4 " + 1 ) + E 2§J 

i=o j=0 
< C-e-W-htoie- 1 ). (13) 
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Having estimated |A(e)|, we are now ready to prove our main claim. For 
this, set N = |A(e)|, i.e., the total number of shearlets ip\ such that the 
magnitude of the corresponding shearlet coefficient (/, ip\) is larger than e. 
By f|T3|) . the value e can be written as a function of the total number of 
coefficients N in the following way: 

e(N) < C ■ N~ 3/2 ■ (log iV) 3/2 , for sufficiently large N > 0. 

This implies that 

|^(/)|v<C-iV- 3 / 2 -(logiV) 3 / 2 . 

Hence, 

E \ 9 U)\l <C-N~ 2 ■ (log iV) 3 for sufficiently large N > 0, 

n>N 

which proves (JH])- The proof of Theorem 11.31 is finished. 



3. Analysis of Shearlet Coefficients associated with the Smooth 
Part of a Cartoon-Like Image 

In this section, we will prove Proposition 12.11 For this, we first prove a 
result which shows that, provided that the shearlet ip satisfies certain decay 
conditions even with strong weights such as (2 4j ')j, the system \l/(c; ip) forms 
a Bessel-like sequence for C 2 (IR 2 ) with compact support in [0, l] 2 . 

In the following we will use the notation rj ~ Sj for rj, Sj G R, if C\ ■ rj < 
s j < C*2 • r j with constants C\ and C-i independent on the scale j. 

Lemma 3.1. Let ip G L 2 (W. 2 ) satisfy 

|^(0| < C r min(l, |eir)-min(l, |ei|- 7 )-min(l, l^l" 7 ) for all £ = G R 2 

where 7 > 3, a > 7 + 2, and C\ is some constant. Then, there exists some 
C > such that, for all g G C 2 (M?) with compact support in [0, 1] 2 ; 



E E E- ; ' ^ '•>•"<) 

3=0 \k\<l23/ 2 ~\ m6Z 2 



< C 



d 2 



dx{' 



The proof of this lemma will explore the following result from [I2j , which 
we state here for the convenience of the reader. 
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Proposition 3.2. flM] Let i/> G L 2 (R 2 ) satisfy 

|^(0| < C r min(l, |eir)-min(l, |6n-min(l, |£ 2 P) /or all £ = e 

where a > 7 > 3 and Ci some constant. Then, there exists some C > 
swc/i i/iat, /or a// 77 G L 2 (R 2 ), 



E E Ei^^-)i 2 ^ c 

3=0 \k\<\2j/ 2 ] m&? 



IK/112- 



Proof. (Proof of Lemma \S. By the assumption on ip, the parameters a 
and 7 can be chosen such that 

|^(£)| < C r min(l, |6| Q )-min(l, |£ir)-min(l, |£ 2 p) for all £ = G M 2 

where a > 7 + 2, 7 > 3. Now, let 77 G L 2 (M 2 ) be chosen to satisfy 

d 2 



7] = 1p. 



Then a straightforward computation shows that 77 satisfies the hypotheses of 
Proposition 13.21 Using integration by parts, 



hence, by Proposition 13.21 



9i Vj,k,m 



E E E- ,u om->. 

3=0 \k\<\2i/ 2 ^ m&& 



2^\(g,^ k , m )\- 



E E E 

3=0 |fe|<[2J/ 2 ] raeZ 2 



<9x 2 



AS Vj,k,m 



< c- 



& 1 



dx\' 



The proof is complete. 

This now enables us to derive Proposition 12.11 as a corollary. 
Proof. (Proof of Proposition \2. Set 

Aj = {A G Aj : supp(^ A ) n supp(o) ^ 0}, j > 0, 



□ 
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i.e., Aj is the set of indices in Aj associated with shearlets whose support 
intersects the support of g. Then, for each J > 0, we have 



j-i 

j = 



|jA i |~2 2J . (14) 

3=0 



Now, first observe that there exists some C > such that 

oo oo oo 

E2 4j Ei%)i' < cj2J2J2 24j \(9,^, k ,m)\ 2 

j=l n>Nj j=l j'=j k,m 

oo / j' 

j'—l k.m \i=l 



By Lemma [3.11 this implies 



E 24J E i%)i' <c-EE 24/ i^^"^)i 2<o ° 

j=l n>Nj j'=l k,m 



and hence, also by (FT4T) . 



E |%)|^<c-(2^)- 2 <c-iv; 



-2 

ri>Af,- 



Finally, let N > 0. Then there exists a positive integer j > satisfying 

N ~ % ~ 2 2jo , 

and the claim is proved. □ 



4. Analysis of Shearlet Coefficients associated with the Disconti- 
nuity Curve 

4-1. Proof of Proposition \2.2\ 

Let (j, k, m) G Aj tP , and fix x = (xi, x 2 ) G int(Q J)P ) D int(supp(^)) PI 95. 
Let s be the slope of the tangent to the edge curve dB at (xi, x 2 ) as defined 
in Proposition 12.21 i.e., if dB is parameterized by (E(x2),X2) in the interior 
of Sj tP , then s = E'(x 2 ), and if dB is parameterized by (xi, E(x\)) in the 
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interior of Sj tP , then s = -E"(xi) _1 , where we now assume that E'{x\) 7^ 
and consider the case E'(xi) = later. 

By translation symmetry, WLOG we can assume that the edge curve 
satisfies E(0) = with (xi,x 2 ) = (0,0). Further, since the conditions (i) 
and (ii) in Theorem 11.31 are independent on the translation parameter m, 
it does not play a role in our analysis. Hence, WLOG we choose m — 0. 
Also, since i/j is compactly supported, there exists some L > such that 
supp(^) C [— L/2,L/2] 2 . By a rescaling argument, we can might assume 
L = 1. Even more, WLOG we can assume that supp(^) = [—1/2, 1/2] 2 , 
which implies 

supp(^,fc,o) = {iel 2 : -2' j/2 kx 2 - 2~ j ~ l < x x < -2~ j/2 kx 2 + 2~ i ~ 1 , 
-2~2 < 2x 2 < 2"2}, (15) 



since this will not change our asymptotic estimate for |Aj iP (e)|. 

Let now / £ £ 2 {v), and select V to be the smallest parallelogram which 
entirely contains the discontinuity curve parameterized by {E{x%), x 2 ) or 
(xi,E(xi)) in the interior of supp^-^o) and whose two sides are parallel 
to the tangent X\ = sx 2 to the discontinuity curve at (xi,x 2 ) = (0,0). For 
an illustration, we refer to Figure SJ We now split the coefficients | (/, i/jj,k,o) I 
into the part 'inside the parallelogram' and 'outside' of it exploiting the 
shearing property of shearlets for the second part, and obtain 

l(/,feo>l = \(Xvf^j,k,o)\ + \(Xvof(S s -)^ jAo )\ (16) 

where k = k + 2^ 2 s. From now on, we assume that k < with k = k + 2^ 2 s. 
The case k + 2^ 2 s > can be handled similarly. 

Let us start by estimating the first term | (xvfi ^j,k,o) I m dlS]) stated as 

Claim 1. 



(1 + \s 



2\3/2 



First, notice that the tangent to the edge curve (E(x 2 ), x 2 ) (or (x\, E{xi))) 
is given by X\ = sx 2 . We now assume that the edge curve is contained in a 
set {x £ M 2 : x\ > sx 2 }, and just remark that the general case can be han- 
dled similarly. Let now d be the length of that side of V, which is a part of 
the tangent X\ = sx 2 . We observe that d is in fact the distance between two 
points, in which the tangent X\ = sx 2 intersects the boundary of supp^^o)- 
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Figure 4: A shearlet ipj,k,o intersecting the edge curve x\ = E{x<i) or X2 = E(xi). The 
right hand side shows a magnification of the parallelogram V . 



For an illustration we wish to refer to Figure HI From this observation, it 
follows that 

+ | s |2 



d 



\k\ 



We now let d be the height of V. Since the edge curve can be parameterized 
by a C 2 function E with bounded curvature, 

d<c i^ /2 VTTW 



\k\ 

Summarizing, the volume of V can be estimated as 

;i + | s | 2 ) 3 / 2 



\v\<c- 



2 3 J/2 . |Af 



This implies 



/i i_ I „|2\3/2 (i , |o|2N3/2 



23J/2 . |fc|3 2 3 J'/4 . |jfe|3 ' 



and Claim 1, i.e., estimate ( 1T7|) . is proved. 
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Next, we estimate the second term, i.e., \ (xv c f(S s -), , ipjj {0 )\, in (TTBj) stated 

as 

Claim 2. 

\M(s,-U jk0 )\<c-(i + \ s \r- + (is) 

Notice that S^V entirely contains the edge curve of f(S s -) in the interior 
of supp(^> - £ ) and that the boundary of the parallelogram S^V consists of 
two vertical line segments in the interior of supp(^ ■ £ ) (see Figured]). By 
translation symmetry, this implies that for proving Claim 2, it suffices to 
estimate 

(MS.')xn,il> jt k,o), 

where Q = {(x 1 ,x 2 ) G M 2 : X\ > 0}, f G C 2 (IR 2 ) compactly supported in 
[0, l] 2 and X]| a |<2 ll-^ a /o||oo < 1- We wish to mention that the consideration 
of the case x% > - compare the definition of the set Q - is by no means 
restrictive, since the case X\ < can be handled in a similar way. 




Figure 5: A shear let ipj % o intersecting the parallelogram Sj l V. The right hand side 
shows a magnification of the parallelogram S^V. 

Now, again by translation symmetry, we may assume that <9(supp ip- £ ) 
intersects the origin. In particular, we may translate supp(y>.£ ) defined in 
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(IT5|) so that we might now consider 

supp(^ j0 ) + (-2^ 1 ,0) 

as the support of ip- . We refer the reader to Figure |6] for an illustration. 
This implies that there is one side of the boundary 9(supp^.^ ), which is a 
part of the line 

£ = {{xi, x 2 ) G M 2 : x 2 = (-2 j/2 /k) ■ xi} 

with slope —2^ 2 /k, as described in Figure El Applying the Taylor expansion 
for fo(S s -) at each point x = (xi,x 2 ) G C, we obtain 



f 2J/2 \ ( 
fo(S a x) = a{x\) + b(xi) \x 2 + — ■ xij + c(x 1 ,x 2 ) \ x 2 + 



k 



+ 



where a(xi),b(xi) and c(xi,x 2 ) are all bounded in absolute value by C(l 
|s|) 2 . This implies (compare also an illustration of the area of integration in 
Figure EJ) 



2H 2 



K/o(S , *-)Xn,^ j jE >0 )l = 

JO j-^ 
< C- (1 + |s|) 2 



23 I 



-i, 3 
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fo(S8X)ip j jj. tQ (x) dx 2 dxi 
3 

y^Jijxi) dxi 



(19) 



where 



h(xi) 
h(x 2 ) 
h(x 2 ) 



2-3/2 



T I3 (^j,fc,o( X l' X 2)J dx 



-3/2 



-i/2 



£2 ■ Tp yl) jt k fi {xi,X2)) dx$ 



with being the translation operator defined by Tp(f) = /(• — /3) and 
/3 G R being chosen to be /3 = (0, [2^1 2 fk) ■ Xl ). 
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Figure 6: A shearlet ip. g intersecting the edge curve xi = such that supp(^> ■ g ) 
intersects the positive £2 axis and <9(supp ?/> . g ) intersects the origin. The illustration 
also highlights the integration area for (|T9| . 



We first estimate 7i(a?i). We observe that, since 

{(xi, x 2 ) e R 2 : *P j ; k>0 (xi,x 2 ) ^ 0} C [0, 2- J ' /2 /|fc|] for a fixed 2* > 0, 

WLOG, for any x x > 0, the interval [0, 2~ j / 2 /\k\} for the range of the inte- 
gration in I\{x\) can be replaced by M (see also Figure [6]). Therefore, we 
have 



^ j ' kQ (x 1 ,x 2 )dx 2 



Now 



l^,o(6,6)| = 2" 3 ^ ■ |^(2-^i,2-^6 - 2-^1)1- 



and hence, by hypothesis (i) from Theorem II .31 

fe,o(6,0)|< 2^. |2-^(2-^i)| -\kr. 
By (ED]) and (ED, it follows that 



Ii(xi) < C 



2?/* 



for some C > 0. 



(20) 



(21) 



(22) 
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Next, we estimate ^(^i)- We have 



x 2 ■ ^ j Xo(x 1 ,x 2 )dx 2 
5i + S 2 . 



2i/2 
+ — ■ 

1*1 



^,0(^1^2)^2 



To estimate Si, observe that, by hypothesis (ii) from Theorem II .3[ 

d 



Si 



1 
2^ 
1 



By (122!) and the fact that < x\ < the second term S 2 can be estimated 



(23) 



as 



OJ'/2 \ 9- 7 '/ 4 , 2 jf / 4 (7 

5 2 < C ■ I \. — <C- (2^/ 2 ) ■ 4- < 



\k\ ' -y 

Concluding from (1231 and (I24I) . we obtain 

J 2 (xi) <^i + 5 2 < 



|fc|T 2^/ 4 -|fc|^ 



(24) 



C 



2^/4 . |jfc|7 



(25) 



Finally, we estimate Iz{xi). For this, notice that 2 3j ' 4 T^(^. ^ (^i, ^2)) 
is bounded, hence 



C?X2 



/ 3 (x!)<2f -C 
Summarizing, by (fT9l), (|22l), (fgol), and (121 

|(/o(^-)xn,^, )| < C-(1 + | S |) 2 - 
< C-(l + \s\) 2 - 
and Claim 2, i.e., estimate (ITS]) , is proved. 



< 



C 



2V ■ 



(26) 



H / i 7 - 



+ -T- 



2^-|ife| 3 j 
1 1 



dx\ 



2^'/ 4 . |A;| 3 2H-\k\ 2 



2(3 



From Claim 1 and 2, i.e., from (jT7J) and (ITS]) , we conclude that 



|(/,feo)| < c (i + | S |) 2 (-3--- + -r—r- ) + (i + l*l 2 ) 3/2 



23 J ' • |fc| 3 2* J ' • |A;| 2 ^ ' v ' ' ' 2* J ' • \k\* 

This implies (jl]) and (J3J except for the case s = oo, which we will study now. 

Finally, we consider the case s = oo, i.e., we assume that the edge curve 
is parameterized by (x±, E(xi)) in the interior of Sj tP such that E G C 2 
and E'(x\) = 0. As before, let / G £ 2 (v), and select V to be the smallest 
parallelogram which entirely contains the discontinuity curve in the interior 
of supp^-^o) and whose two sides are parallel to X2 = 0. Similarly as before, 
we consider 

I (/, $j,k,o) I = I (Xvf, ipj,k,o) I + I (xv°f, ipj,k,o) I • (27) 

and estimate both terms on the RHS separately. 

We first consider the term | (xvfi V'j.Jfc.o) I • Using similar arguments as 
before, which we decided not to include in detail to avoid repetitions, one 
can prove that 

\{Xvf,1>j,k fi )\ <C2-K (28) 

Turning our attention to the second term \ {xr c f,' l Pj,k,o)\, we first observe 
that V entirely contains the edge curve of / in the interior of supp^-^o) 
and that the boundary of the parallelogram V consists of two horizontal line 
segments in the interior of supp^-^o) (see Figure [7]). By translation sym- 
metry, this implies that for the second term in (j2"7|) . i.e., for | {xv°fi ^fc.o) 1 1 
it suffices to estimate |(/oXoj V'j.fe.o)!' wnere ^ = {{ x i^ x i) G M 2 : x 2 > 0} and 
fo G C 2 (R 2 ) compactly supported in [0, l] 2 with E| a |< 2 \\D a fo\\oc < 1. As 
before, the consideration of the case x 2 > - compare the definition of the set 
Q - is by no means restrictive, since the case x 2 < can be handled in a sim- 
ilar way. Observe that the (horizontal) vanishing moment condition follows 
from condition (i) in Theorem 11.31 Let us briefly think about this: Letting £ 2 
be fixed, condition (i) immediately implies V>(0, £2) = 0. Then, by applying 
Taylor expansion, it follows that $(£1, £2) = ^(0, £2) ■ £1 + CK£ 2 )> and, again 

by condition (i), we have as £1 -> 0, hence ^(0,£ 2 ) = 0. This 

procedure can now be continued. Especially, condition (i) from Theorem 11.31 
implies 

/ x[ ■ ^(x u x 2 )dx 1 = for all x 2 G M and £ = 0, 1. (29) 
Jr 
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Further, we utilize that the shearing operation Sk preserves vanishing mo- 
ments along the x\ axis. That is, 

/ x{ ■ t/j(Sk(xx,X2) T )dxi = for all k, %2 G K and £ = 0, 1. 
Jr 

This can be seen as follows. For i = 0, 1 and for a fixed x% G M, the function 
x\ i— > (xi — fca^ is a polynomial of degree less than or equal to hence, by 
condition ( )29|) on the number of vanishing moments on -0, we have 




X2)dx\ = for all k G R. 



2-i/a 



Figure 7: A shcarlet ^ g intersecting the edge curve x 2 = such that supp(^. g ) 
intersects the positive x\ axis and <9(supp ip^ f, ) intersects the origin. The illustration 
also highlights the integration area for (|30| . 

Employing Taylor expansion and integration (compare Figure [7]) similar 
to the proof of the previous case, we finally obtain 

\(foXn,^,k,o)\ <C-2 3 ^. ^ f x\dx x dx 2 <C-2-^' A (30) 

JO J ~2-i 

By (l28i) and ( 1301) . we obtain (jSJ) when s = oo. This completes the proof. 
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